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Abstract — Spatial coupling has recently emerged as a powerful 
paradigm to construct graphical models that work well un- 
der low-complexity message-passing algorithms. Although much 
progress has been made on the analysis of spatially coupled mod- 
els under message passing, there is still room for improvement, 
both in terms of simplifying some existing proofs as well as in 
terms of proving some additional properties. 

We introduce one further tool for the analysis, namely the 
concept of displacement convexity. This concept plays a crucial 
role in the theory of optimal transport and it is also very well 
suited for the analysis of spatially coupled systems. In cases where 
the concept applies, displacement convexity allows functionals 
of distributions which are not convex to be represented in an 
alternative form, so that they are convex with respect to the 
new parameterization. The additional convex structure can then 
often be used to prove the uniqueness of the minimizer of this 
functional. As a proof of concept we consider spatially coupled 
(l,r) -regular Gallager ensembles when transmission takes place 
over the binary erasure channel. In particular, we prove that 
the potential functional governing this system admits a unique 
optimal "profile" which characterizes the "decoding wave" of the 
spatially coupled system using the tool of displacement convexity. 

I. Introduction 

Spatially coupled codes were introduced in the form of 
low-density parity-check codes by Felstrom and Zigangirov in 
HI. Such codes are constructed by spatially coupling nearby 
replicas of a code defined on a graph. It has been proven 
that such ensembles perform very well under low-complexity 
message-passing algorithms. Indeed, this combination achieves 
essentially optimal performance. More generally, the concept 
of spatial coupling is proving to be very useful not only for 
coding but also in statistical physics and compressive sensing. 
Given this range of applications, it is worth investigating basic 
properties of this construction in generality. Our aim is to 
introduce one further tool for the analysis of such systems - 
namely the concept of displacement convexity. Displacement 
convexity plays a crucial role in the theory of optimal transport. 
But it is also very well suited as a tool for the analysis of 
spatially coupled graphical models. 

Using the simple case of regular LDPC ensembles and 
transmission over the binary erasure channel (BEC), we ex- 
plain how the concept of displacement convexity can help 
to simplify some existing proofs and to derive new results. 
The actual range of applications which can benefit from 
this concept is considerably larger and we pose some open 
problems in this respect as further research directions. 

One of the most important properties of spatially coupled 



codes is that they exhibit the so-called threshold saturation 
phenomenon. That is, spatially coupled ensembles genetically 
have a BP threshold which is as large as the maximum-a 
posteriori (MAP) threshold of the underlying ensemble, i.e., 
their threshold has saturated to the largest possible value. This 
result has been proved for transmission over the BEC in [0, 
0, and lID and for transmission over general binary-input 
memoryless output-symmetric (BMS) channels in |5| and [6|. 

Our goal is to provide a new proof technique that targets 
the same problem. The tool we introduce is applicable to 
systems which are governed by a variational principle; we 
use (7,r)-regular Gallager ensembles as a proof of concept. 
We run the belief-propagation algorithm on this ensemble and 
express the potential functional using the results in [3|. The 
potential functional is the analog of the Bethe free energy, 
and the density evolution (DE) equations can be obtained by 
differentiating this potential. Our goal is to find the fixed-point 
(FP) solution(s) of the DE equations, which is equivalent to 
finding the minimizer(s) of the potential. 

For the simple case we analyze, we use displacement 
convexity [7| to prove that the potential describing the system 
is convex with respect to an alternative structure of probability 
measures. We consider the static case, when the decoder phase 
transition threshold is equal to the maximum a posteriori 
(MAP) threshold. Using this alternative convex structure, we 
then prove that the potential of the ensemble admits a unique 
minimizing profile. 

The paper is organized as follows: Section|II]introduces the 
framework for our analysis and our main results. We then give 
a quick introduction of the notion of displacement convexity in 
Section|III] Finally, Section|IV]presents a proof of existence of 
the profile that minimizes the potential, and Section IV1 proves 
that this minimizer is unique. 

II. Setting and Main Results 

In this section, we introduce the model and the associated 
variational problem to which we apply the displacement con- 
vexity proof technique, and we state our main result. 

A. (Z, r, L, w)-Regular Ensembles on the BEC 

Consider the spatially coupled (I, r, L, w)-regular ensem- 
ble, described in detail in [5|, where the parameters represent 
the left degree, right degree, system length parameter, and 
coupling window size (or smoothing parameter), respectively. 
Specifically, the ensemble is constructed as follows: consider 
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Fig. 1 . The plot of the single system potential W s (p) as a function of the 
check-erasure probability p, for a (3, 6) uncoupled ensemble and e = £map- 
There are two minima at p = and p = pmap • 



2L + 1 replicas of a protograph of an (I, r)-regular ensemble. 
We couple these components by connecting every variable 
node to I check nodes, and every check node to r variable 
nodes. The connections are chosen randomly: for a variable 
node at position z, each of its I connections is chosen uni- 
formly and independently in the range [z, . . . , z + w — 1], and 
for a check node at position z, each of its r connections is cho- 
sen uniformly and independently in the range [z—w+1, . . . ,z}. 

For the channel we take a BEC with parameter e. If 
x z ; z e [—L,...,L] denotes the erasure probability of the 
variable node at position z, the fixed-point (FP) condition 
implied by density evolution is 
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This FP condition can be obtained by minimizing a "potential 
functional". It can be shown using the results in [3 1 that the 
potential is 
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At this point the normalization 1/w is a convenience whose 
reason will immediately appear. 

The natural setting for displacement convexity is the con- 
tinuum case. We will therefore consider the continuum limit of 
(QJ. Extending our results to the discrete setting is one among 
various open problems to which we briefly come back in the 
conclusion. We define the rescaled variables z — — , u = — 
and the rescaled function x(-) = x z . It is easy to see that (Q} 
becomes a Riemann sum. When we take the limit L — > +oo 
first and then w — > +oo, we find 



dz J - s(2)(l - x^y- 1 - 1(1 - x(2)) r 



(2) 
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du[l - (1 -x{z + u)) 
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At this point the reader might wonder if the integrals converge. 
As explained in the introduction, we look in this paper at the 
decoder phase transition threshold e = e MAP . We give at the 
end of this paragraph the conditions on the erasure probability 
profile needed to have a well defined problem. From now on, 
the reader should think of the noise level as fixed to the value 
( = £map, although we abuse notation by simply writing e in 
the formulas that follow. 

It is more convenient to express (f2} with the function 
p(z) = 1— (1 — x(z)) r ~ 1 . Note that this function is interpreted 
as the erasure probability emitted by check nodes. Summariz- 
ing, the potential functional of interest is 



W\p(-)]=J cb 
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(l-l)(l-p(z))^-(l-p(z)) 



(3) 
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A word about the notation here: we use square brackets 
for functionals, i.e. "functions of functions" and usual round 
brackets for functions of a real variable. Equation (|3) can 
be expressed as a sum of two contributions W s ingio[p(-)] + 
Wint[p( - )] which are defined as follows: 

W singl c[p(-)] - [dzl (l - J)(l -p(z))^ - (1 - P (Z)) 



r I 



p(z) 1 



dzW s (p(z)), (4) 



Wi„ t [p(-)] 



dz-ip(z) 1 



dup(z + u) 



(5) 



We call © the "single system potential functional" and (O 
the "interaction functional". The following remarks explain 
the interpretation suggested by these names. The term (0 
vanishes when evaluated for a constant p{z) = p. Moreover 
the integrand of |@), namely W s {p(z)) = W s (p) is just the 
potential of the underlying uncoupled code ensemble. This 
is easily seen by recognizing that the usual density evolution 
equation for the erasure probability of checks is recovered by 
setting the derivative of W s {p) to zero. We will call W s (p) 
the "single system potential". A plot of W s (p) for the (3,6)- 
ensemble is shown as an example in Figure U when e = e MAP . 
The figure shows that the single potential vanishes at p = 
and p — p MAP , some positive value. This is a generic feature of 
all (/, r)-regular code ensembles as long as I > 3 (for cycle 
codes I = 2, we have p MA? = 0). This shows, in particular, that 
in order for the integrals in ((3} to be well defined, we have to 
consider profiles p(-) G S where 



S = {p(-) : K. — > [0,p MAP ] s.t lim zp(z) = 0, 

z— > — oo 

lim z(p(z) -p MAP ) = 0}. 
In particular the left limit is and the right limit is p M 



(6) 



B. Main Results 

It is easy to see that W[p(-)] is bounded from below, more 
precisely inf p (o e s W[p(-)] > -|pL.. Indeed Wa(p) > as 
seen in Figure [I] and using Jensen one can show Wi n t[p(")] — 
— 5-Pmap ( see Lemma R~Tt , The first non-trivial question one 
may ask is whether a minimum is attained in S. This question 
can be answered by means of the so-called "direct method", 
a standard method of calculus of variations fPJl , lfl4l . (see 
Section HV-Bl i. The next natural question is whether there is a 
unique minimizer. The functional is translation invariant i.e., 
W[p(- + a)} = W\p(-)] so it is natural to ask whether there 
is a unique minimizer up to translation. When a functional 
is strictly convex the answer is immediate. Unfortunately, 
W[p(-)] is not convex (and hence not strictly convex). However, 
and this is the main point of this paper, we will show that it 
can be represented in an alternative form so that convexity 
(and strict convexity) can be used in this alternative domain. 
This alternative notion of convexity that we will use is briefly 
explained in the next section. It will allow us to prove: 

Theorem 2.1: Let e = e MAP . The functional W[p(-)] attains 
its minimum on S. Moreover, this minimum is unique up to 
translations. 

Let us briefly outline the proof strategy of this theorem. We 
show using rearrangement inequalities [12] that the minimizers 
in S are necessarily increasing (see Lemma 14.3) . Note that 
increasing profiles can be viewed as cumulative distribution 
functions (cdfs) and in particular they have an inverse function. 
This turns out to be the right setting both for the existence 
proof (Section HV-Bb and for the application of the displace- 
ment convexity technique to prove uniqueness (Section Ml. 

III. Displacement Convexity 

Displacement convexity can be very useful in functional 
analysis. It goes back to McCann [7] and plays an important 
role in the theory of optimal transport [8 1. It has been used in 
[9 1 and [10] to study a functional governing a spatially coupled 
Curie- Weiss model [11], which bears close similarities with the 
coding theory model studied here. In this section, we give a 
quick introduction to the tool of displacement convexity. 

Recall first that the usual notion of convexity of a generic 
functional J r [p(-)] on a generic space X means that for all 

K-),p'(-) e * 

^bA(0]<AJF[KO] + (i-AW(.)], 

where p x {z) = (1 - X)p(z) + Xp'(z), A G [0, 1]. 



its inverse z(-): 



Lemma 14.31 in Section [TV] shows that we can restrict the 
minimization problem to the space of increasing profiles. Thus, 
the discussion below assumes that we consider only such 
profiles. This is the correct setting for defining displacement 
convexity. 

An increasing profile with left limit and right limit p MAP 
can be thought of as a cdf (up to scaling because the right limit 
is not 1). Further, such increasing functions have increasing 
inverse functions (which can also be thought of as cdfs, up 
to scaling). More precisely, consider the following bijective 
maps that associate (with an abuse of notation) to a cdf p(-) 



z{p) =inf{z : p(z) > p}, 
p(z) — inf{p : z(p) > z}. 



For any two increasing profiles p{-),p'{-) G S, we consider 
z(-),z'(-) their respective inverses under the maps defined 
above. Then for any A G [0, 1], the interpolated profile p\(-) 
is defined as follows: 

zx(p) = (l-\)z(p) + \z'(p), 
p\(z) = ini{p : z x > z}. 

In words, the difference in interpolation under the alternative 
structure is that the linear interpolation is applied on the inverse 
of the profiles of interest, and the effect of such an interpolation 
is then mapped back into the space of profiles. Displacement 
convexity of W[p(-)] on the space S simply means that the 
following inequality holds: 



W[px(-)}<(l-X)W[p(-)} + XW[p'(-)} 



(7) 



for any p(-),p'(-) G S. Strict displacement convexity means 
that this inequality is strict as long as p(-) and p'(-) are distinct. 
We will prove the strict displacement convexity of W[p(-)] by 
separately proving this property, in Sections |V-A| and IV-B1 for 
the two functionals (0|i and ©, respectively. 

IV. Existence of Minimizing Profile 

In this section, we prove that the functional W attains its 
minimum. 

A. Preliminaries 

We start by some preliminaries to show that one can 
restrict the search of minimizing profiles to those in S that 
are monotone increasing. The proofs of the lemmas can be 
found in Appendices lAllDl 

The first lemma states that the interaction potential is 
bounded from below. 

Lemma 4.1: For any p(-) in S, 



Az { p(z) 1 



dup{z + u)\ \ > --p l MAF . 



Proof: See Appendix lAl ■ 

We remark that any constant lower bound is sufficient 
for our purposes: finding profiles that minimize a potential 
is equivalent to find those that minimize a potential added to 
a constant. The following lemma states that a truncation of 
the profile at the value p MAP decreases the potential functional, 
so we may restrict our search of minimizing profiles to those 
with range p(z) G [0,p MAP ]. 

Lemma 4.2: Define p{z) = min{p(z), j» MAP }. For all p G S 
we have 

w[p(-)] > W-)]. 

Proof: See Appendix 151 ■ 

We next restrict our search of minimizing profiles to increasing 
ones. In order to achieve this we will use rearrangement 



inequalities. Let us first recall the notion of an increasing 
rearrangement. In words, an increasing rearrangement asso- 
ciates to any function p(-) e S with range [0,p M Ap] an 
increasing function p*(-) so that the total mass is preserved. 
More formally, any non-negative function can be represented 
in layer cake form 



p{z) 



+00 



dt±E t {z), 



where 1^ {z) is the indicator function of the level set E t = 
{z\p(z) > t}. For each t, the level set E t can be written as 
the union of a bounded set A t and a half line ]a t , +00 [. We 
define the rearranged set E% =\a t — \A t \ : +00 [. The increasing 
rearrangement of p(-) is the new function p*(-) whose level 
sets are E^. More explicitly, 

/•+00 
p*(z)= / dtt E; {z). 
Jo 

Lemma 4.3: Take any p(-) € S and let p*(-) be its 
increasing rearrangement. Then, 

W[p(-)}>W[ P *(-)}. 



B. The Direct Method 

The direct method in the calculus of variations |[T3l - lfT4l is 
a standard scheme to prove that minimizers exist. We use this 
method to obtain the following theorem: 

Theorem 4.5: The potential functional W[p(-)] attains its 
minimum in the space S', and hence in S. 

Proof: Let us take any minimizing sequence p n {-) of 
cdfs, i.e. a sequence that satisfies ([8]). By Lemma 14.41 the 
corresponding sequence of measures is tight. Thus by a simple 
version of Prokhorov's theorem for measures on the real line, 
we can extract a (point-wise) convergent subsequence of cdfs 
Pn k {) - ► Po(') as k —> +°° with Po(') £ S'. By Fatou's 
Lemma, one can check that the potential functional is lower- 
semi-continuous, which means 



W\p (-)} <]immfW\p„ k (-)]. 

k— ^-+00 



(9) 



Putting © and (O together, 

W[po(-)] < Iiminf W[pn fc (-)] < lim %»(•)] = inf W[p(-)]- 
On the other hand inf W[p(-)] < W[po(')]- Thus we conclude 
infW[p(-)] = >V[PoT)]- ■ 



Proof: See Appendix ICl ■ 

We can thus restrict the search of minimizing profiles to the 
space of increasing profiles. Furthermore because of translation 
invariance of the potential functional it is always possible to 
translate a increasing profile so that p(z) crosses the value 2^ 
at the origin. In other words infg W[p(-)] > infs' W[p(-)] with 

S' = {p(-) G S : increasing and , p{z) < —^~,z < 0, 

p(z) > —,z>0}. 

In particular, we can think of a profile as a cdf associated to 
an underlying measure (up to scaling). 

The final step of these preliminaries concerns a necessary 
condition that any minimizing sequence must satisfy. A mini- 
mizing sequence in S 1 is by definition any sequence p n {) £ S' 
such that 



lim W[ Pn (-)] = inf W[p(-)}. 

n—¥oc p£S' 



(8) 



Such a sequence exists as long as the functional is bounded 
from below; which is indeed true because of Lemma 14.11 and 
W s (p) > 0. Consider the sequence of probability measures 
associated to the sequence of cdfs p n (-). The following lemma 
states that this sequence of measures is tight. 

Lemma 4.4: Let p n (-) £ S' be a minimizing sequence of 
cdfs. For any 6 > we can find Ms > (independent of n) 
such that 

p„(Ms) -Pn(-Mg) > (1 - 6)p MAP 

for all n. 

Proof: See Appendix iDl ■ 



V. Proof of Displacement Convexity of the 
Functional 

This section contains the main result of the paper, namely 
that the potential functional W[p(-)] is strictly displacement 
convex in S' . Concretely, we prove the strict form of (£7). This 
allows to conclude that the minimizer is uniqueQ Hence we 
also conclude that the DE equations governing the system have 
a unique FP solution. 

A. Displacement Convexity of the Single-Potential Term 

We first prove that the single-potential functional 
Wsingic[p(-)] i s displacement convex. The proof is very simple. 
Note the single system potential W s (p) is not convex in the 
usual sense (see Figure U). 

Theorem 5.1: Let p(-) and p'(-) be in S' and let p\(-) the 
interpolating profile as defined in Section [HI] Then 

W sing icM-)] = (1 - A)W singlc [p(-)] + AW singlo [p'(-)]- 

Proof: Recall that W sing i e [p(-)] = LdzW s (p(z)). Recall 
also that p\(z) as defined in Section [III] is the inverse of 

z x {p) =Xz(p) + (l- \)z'{p). Thus 



dzW s {p x {z)) = / dz x (p)W s (p) 

R JO 

pPMAP pPMAP 

= (1 - A) / dz(p)W s (p) + A / dz'(p)W s (p) 
Jo Jo 

= (1 - A) / dzW s { P {z)) + A / dzW s (p'(z)). 
Jr Jr 



Thus, the function A — > Wangle [pa( - )] i s li near > hence convex. 

1 Recall that in the definition of 5' we have removed the translational degree 
of freedom. 



B. Displacement Convexity of the Interaction-Potential Term 

The study of strict displacement convexity of the interac- 
tion potential term involves a more complicated analysis. 

Theorem 5.2: Let p(-) and p'(-) be in S' and let p\(-) the 
interpolating profile as defined in Section [Hi] Then 



WintM')] < (1 - A)Wintb(-)] + AWint[p'(-)] 



(10) 



Proof: Since p can be seen as a cdf we associate with it 
its probability measure /i such that p(z) = Pmap J_ d/i(x). 
Let us rewrite the interaction functional in the form 



WintM-)] 



d^A^i) . . .d/j,x(xi)V(xi, . . -,xi), (11) 



where V(xi, . . . ,xi) is a totally symmetric "kernel function" 
that we will compute. There is an argument (see [8|) that 
allows to conclude ( fTQb whenever V is jointly convex (in 
the usual sense). Let us briefly explain this argument here. 
Consider the measures /i, p! associated to cdfs p(-), p'(-). 
Then there exists a unique increasing map T : R — > K, such 
that p! = T#p. Here T#/z is the push-forwarcQ of [i under 
T. Then from X\(jp) = (1 — X)x(p) + Xx'(p) we have that 
li\ = T A #/i where T x (x) = Xx+ (1 - A)T(cc). Equation O) 
can be written as 

W int [p x (-)] = f dnixi) . . . dp(xi)V(T x (x x ), ■ • • ,T x (xi)) 



,T x (xi)). 



l\ I dp( Xl )...dp(xi)V(T x (xi) 
ls x 



In the second equality we restrict the integrals over the 



sector S x 



{x = (xi,- ■■ ,xi) 



> Xj if i < j}, 



which is possible since V is totally symmetric. Now it is 
important to notice that since T is an increasing map we 
have T\{xi) > ■■■ > T x (xi) for any A e [0,1]. Moreover 
the A dependence in the kernel function is linear. Thus the 
proof of displacement convexity ultimately rests on checking 
that the kernel function is jointly convex in one sector, say 
S X U In f ac t the kernel function is translation invariant and can 
be expressed as a function of the distances du = x\ — x\, 
i = 1, . . . , I. We will prove joint convexity of V as a function 
of these distances. To get strict displacement convexity on 
S' we require convexity of the kernel function, and strict 
convexity as a function of d = (di2, ■ ■ ■ , du) on a non-zero 
measure subset of the d-space. 

Now it remains to compute V and to investigate its con- 
vexity. With appropriate usage of Fubini's theorem and after 
some computations, we find 



Wintb(-)] = 



C P,, 



IV 



um 




dz 



R 



n« 



n«(«-(af<-«i))H, (12) 



2 Given a measurable map T : R — Y R, the push-forward of v under 
T is the measure T#i/ such that, for any bounded continuous function 4>, 
f 4>(T(x))d»(x) = f <(,(x)d(T#v)(x). 

K K 

3 By symmetry, convexity in one sector implies convexity in other sectors. 
However this does not mean that convexity holds if arguments are taken 
in different sectors. And, indeed in the present problem one can check that 
convexity only holds within each sector. 



where 6(x) denotes the heaviside step function at x. So the 
kernel V(xi, . . . ,xf) in (fTTT i is the integrand of the first I 
integrals in (TTSI i. Our goal henceforth is to prove that V is 
convex in the usual sense. We will prove a stronger statement, 
namely that V u is convex, where u = (ui, . . . , ui), 



U 



u (x) - I dz( f[d(z - Xi) -f[0(z- (Xi - Ui)) 

We recall here that we restrict our analysis to the sector of 
the space of variables S x . Also, we remark that Yii=i @( a i) = 
9( max di). We observe that V u can be written in terms of the 

distances du = x\ — Xi, i = 2, . . . , I as (here du = 0) 



V u (x) = / dz{9(z — xi) — 6( max (z — (x 
Jr » =1 -« 



= — min (x\ — Xi + Ui) 



-1...1 



Ui)))} 

min (du + Ui) 



Lemma 1531 below states that Ki(x) is jointly convex in S x 
for all u. This implies that V(x) is jointly convex in S x . 
In Appendix [E] we give explicit formulas for V in terms 
of the distances du and prove that it is strictly convex in 
a subset of non-zero measure. Concretely, this subset is a 
small neighborhood of the origin du = 0, i = 2, . . . , I. This 
completes the proof. ■ 

Lemma 5.3: The function / u (d) = min^n + Ui) is 

i 

concave in d, where d = (d 12 , . . . , du) and du = 0. 

Proof: Let d and d' be two instances of the argument of 
/ u . Then, for A e [0, 1], 

/ u ((l - A)d+Ad') = min((l - X)d u + \d' u + u t ) 

i 

= min((l - X)(d u + Ui) + X(d'u + u t )) 

i 

> (1 — A) min(dii + Ui) + A min(d^ + Uj) 

i i 

= (1-A)/ U (d) + A/ U (d'). 
This shows concavity. ■ 

VI. Conclusion 

In this paper, we demonstrate a new tool for the analysis of 
spatially coupled codes, namely the concept of displacement 
convexity. This tool makes use of an alternative structure of 
probability distributions and hence applies to an appropriate 
space of increasing profiles. We expressed the potential func- 
tional governing the (I, r)-regular ensemble and proved that it 
indeed admits a minimizing profile in the appropriate space 
of profiles, and that it is strictly convex under the alternative 
structure. This result implies that the potential functional 
admits a unique minimizing profile, or equivalently, that the 
DE equations governing the system admit a unique FP solution. 

There are several questions that can be posed in this 
context. First, we recall that the original potential functional 
governing the system at hand is in discrete form. Can one 
extend the displacement convexity framework to the discrete 
setting? Displacement convexity can presumably be used to 
analyze a large range of problems with flavors similar to the 
present one. In particular, it can presumably be applied to 
general BMS channels, the random K-SAT and Q-coloring 
problems to name a few. We plan to come back to these 
problems in the future. 



Appendix 
A. Positivity of the Interaction Potential 

Proof of Lemma \4~J\ From Jensen's inequality, 

1 1 



du p(z + u) > I / dup(z + u)\ . 
o 



(13) 



Further, 



Ml 1 M+u 

dz / du p(z + u) = / du / dz' p(z') 

MO -M+u 



-M 



M 



M+u 



du( / dz'p(z') l + / dz'p(z') l + / dz'p(z') 1 ) 



-M+u 



-M 



M 



where (a) is obtained by first changing the order of integration 
(which is admissible since the integral converges) and then 
making the change of variable z' — z + u. And so, by 
combining this identity with ( fT3l we obtain 



M , 1 

I dz I p(z) 1 — I I du p(z + u)\ 



-M K ' 

1 -M 1 M+u 

+ f du f dz 1 p(z') 1 + I du I dz' p(z') 1 > 

-M+u M 

Now we take the limit M — > +oo for each term of this 
inequality. By an application of Lebesgue's dominated con- 
vergence theorem, the last two terms tend to zero and \p l WM ,, 
respectively. Therefore the limit of the first term is bounded 
from below by — ^p„ AP , which concludes the proof. ■ 



B. Truncation of Profiles 



Proof of Lemma WM It is easy to prove that a truncation 
of p(z) at p MAP yields a smaller value for the single system 
potential VV S (p(z)) (see e.g. the Figure U for an intuition). 
Therefore we have HWgie[p(-)] ^ Wangle [?(•)]• 

We now treat the functional corresponding to the interac- 
tion term. We define the function g as g(z) = p(z) —p(z) and 
notice that: 



p{z) < Puav =*• g{z) = and p(z) = p(z), 
p(z) > p MAr ^> g(z) > and p(z) = p MAP . 



(14) 



We need to show that Wmt[p(-)] < Wint[p(-)]' or equiva- 
lently that: 

i 
dzlp(z) 1 - I dup(z + u)) > 

E o 

i 

< J dz { (p(z) + g(z)) 1 - ( j du (p(z + u) + g(z + u)))' } 



Using the binomial expansion this is equivalent to 

f-i 



Y,Qjdzip(zyg(z) 



l-i 



B 

1 1 

dup(z + u)\ ( dug(z + u)) ^>0 

o o ' 

In the following steps, we show that the integral inside the 
summation above is positive for any fixed value of i; the 
inequality follows directly. We see that: 

l l 

I / dup(z + u)\ I / dug{z + u)\ 

o o 

l l 

< Kap ( dug(z + u)J < p l UAV / du g(z + u) 



l-i 



where the first inequality is due to the property p(z) < p MAP 
and the second is using the convexity of the function f(g) = 
9 l \ 9 > 0. We integrate over z, then make the change of 
variable z' = z + u on the right-hand side to obtain: 



l l 

dz I dup(z + u)) I dug(z + u) 
E o o 



(15) 



<J teAvJ dug(z + uy l =J dz p l MAP g(zy 

R R 



J ~.i „l -i\l- 



Using the properties of g in (fT4l . we remark that 
/ dzp l MM ,g{z) 1 ^ 1 = J dzp(z) 1 g(z) l ~ l and so the difference 

R R 

of quantities in the inequality ( |15T > is integrable, and thus we 
obtain: 

i i 

dzi / dup(z + u)\ I / dug(z + u)j < / dzp(z) % g{z) ~ l 
E o o R 

for any i. This yields the desired result Wj n t [?>(•)] > 

Wnt[p(')]- ■ 

C. Rearrangement of Profiles 

Before proceeding with the proof of Lemma 14.31 we state 
a general rearrangement inequality of Brascamp, Lieb and 
Luttinger |[T2l . 

Theorem A.1: Let fj, 1 < j < k be nonnegative measur- 
able functions on R, and let aj m , 1 < j < k, 1 < m < n, 
be real numbers. Then, if /* is the symmetric decreasing 
rearrangement of /, we have: 

/k / n \ r k / n \ 

d n x Y[fA ^2 a ]m x m J < / d n x Y[fj[ ^2 a 3 mXm 



R" 



j=l \m=l , J; 



j — 1 \ m— 1 



(16) 



Remark Theorem lA.il is nontrivial only if k > n. Otherwise, 
both integrals diverge and the inequality trivially holds. We 
will see in this section that k > n in our case. 



Proof of Lemma \4.3\ It is sufficient to prove that the 
increasing rearrangement of a profile decreases Wint[p(-)]' 
since W S mgle[p( - )] i s invariant under any rearrangement. 



Theorem IA.1I applies to symmetric decreasing rearrange- 
ments. Therefore it is convenient to first "symmetrize" the 
profile and the functional. Consider a profile p() £ S such 
that p(z) £ [0,p MAP ] (due to Lemma \4.2\ and denote by 
p(-) the function such that p{z) — p(z), z < R and 
p(z) = p(2R — z), z > R. The value R is chosen (large 
enough) so that p(R) is arbitrarily close to p MAP . Note that p(-) 
is integrable over R. 

We recall the expression of Wj„t[p(-)] in (O and rewrite 
it as: 



Wint[p(0] 



I R- 



lim 



R R 1 

dzp(z) 1 — / dz( / dup(z + u)) >. 



We now express both integrals in the bracket in terms of the 
symmetrized profile. For the first one, this is immediate 



dzp(z) 



dzp(z) 1 



(17) 



For the second one, some care has to be taken with the 
averaging over u when z is near R. One has 



dz( / du p{z + u)) 
o 

= \J dz(J dup(z + u)) l +o(±). (18) 



Replacing these two formulas in (fTTI i we have the representa- 
tion 



W int [p(-)] 



2/ 



dzp(z) 



e 

2/ 



dz( dup(z + u)) . (19) 

R JO 



Now consider p*(-), the symmetric decreasing rearrange- 
ment of p(-). The first term in (fT9l is invariant under rearrange- 
ment. It remains to prove that the second term in ([19) increases 



upon rearrangement. We express it as follows (dropping e/2Z): 

l 



dz I du p(z + u) I 
o 

dz / JJdWip(« + «i)l[ 0l i](tti) 

= / dz J Yldu', p(z' + R + u^ + -)1[_ !_!]«) 

§/ dz '/n du '' r(z ' +jR+ ^ + ^ i [-^^ ( ^ ) 

R R' l=1 

= I dz I JJduip*(z + Ui)i[o,i](ui) 

i 
dz I du p*(z + u)j , 



R R ! 



where the equality in (b) is due to the changes of variables 
z' = z — R and u^ — m — \ ; i = 1 . . . I, the inequality in (c) 
is due Theorem lA.il and the equality in (d) is obtained by first 
remarking that the indicator function lr_i ii(?4) is unchanged 
upon rearrangement and then by making the reverse changes 
of variables z = z' + R and u% = vl i + 4; i = 1 ... L So far 
we have obtained 

vMp(-)] 

1 
"§ I dzp*{z) 1 - -^ J dzU dup*(z + u) 

R R 

To obtain Wi n t[p(")] > VV; n t[p*( - )] i 1 remains to reverse the 
steps (Qj]l-([IS]l- ■ 



D. Tightness of the minimizing sequence 



Proof of Lemma \4~4\ Consider a minimizing sequence of 
cdfs p n (-), i.e. satisfying (|8). Fix any S > and suppose that 



p n (M)-p n {-M) < (l-S)p M 



(20) 



We will show that d20l ) implies that necessarily M < c/S 2 for 
a fixed constant c > 0. Taking the contrapositive we find that: 
choosing Mg = c'/^ 2 with c' > c implies that any minimizing 
sequence satisfies p n (M s ) - p n (-M s ) > (1 - S)p MAP . 



From Lemma 14. II we have 



Wb4')]>W smg i e b„(-)]-^ 



dzW s (p n (z))- 



<"Pm 



2/ ■ 
(21) 
Now, assuming (l20b there must be a mass at least 5p MA? outside 
of the interval [— M, M). Thus we have p n (— M) > Sp MAP /2 or 
Pmap - Pn(M) > <5p„ AP /2. So in [-M, M] the profile p n (z) is 
<5Pmap/2 away from the minima and p MA? . Moreover, one can 
check that W s (p) has a parabolic shape near the minima at 
and p MAP so that away from these minima W s (p) > CS 2 p 2 tA! ,/A 



for a constant C > depending only on I. These remarks 
imply 



M 



1 



M 



dzW s ( Pn (z))>-MC5 2 p, 



2^2 

MAP " 



(22) 



Since p n {-) is a minimizing sequence, for n large enough its 
cost must be smaller than the cost of a fixed reference profile, 
say p(z) — 0, z < 0, p(z) = p MAP , z > 0. More formally, 



W[p„(-)] < W[p(-)] = -- 



rP M 



J(J + l) J 

Finally, combining (ETT i. (l22l and (1231 we find that 



M < 



1-2 

AlAP 



1(1 + 1) CS 2 ' 



(23) 



(24) 



E. Explicit expressions of Kernel Function 

In this section, we compute the kernel function V of section 
IV-Bl and illustrate some of its properties. In particular we show 
that it is strictly convex in a non-zero measure set. 

Recall that the function is totally symmetric under permu- 
tations, and it is therefore enough to compute it in a fixed 
sector S x = {x = (x\,--- ,Xi) : Xi > Xj if i < j}. We 
express V in terms of the distances du, which are ordered 
such that du < dy if i < j. 

Let us first discuss the explicit examples I — 2 and I = 3. 
For I = 2 an explicit computation yields, 



V(J = 2)(dl2) = 



2 



i(l ~d 



12 J 



if d 12 > 1, 

if d 12 < 1. 



By taking the second derivative it is easy to see that Vn—2) is 
convex everywhere, and strictly convex for di2 < 1. 

For ^ = 3, we have d 12 < di3 and the computation yields, 

V(j =2 )(dl2) 

V { l=3)(dl2,d 13 )= J-I + i(l-di 2 ) 3 
+ ^(l-dl3) 4 

For di3 < 1 the Hessian is 



ifdis>l, 



if di 3 < 1. 



1-d 



12 



-2^13-1) 



"2^13 - 1 ) 2 -(di3-l)(l + di2-d 13 ) 

and the corresponding eigenvalues are: 

Ai, 2 = -{2 - 2d 13 - disdis + d? 3 ± VA}, 



where 

A = 1 + 4d? 2 - 4di 3 - 8di 2 di 3 - 4d? 2 di3 

'13 + "I2"l3 — °^13 



10d? 3 
8tti2W^3 ~i~ f*T2 13 — ^ 13 — 2wi2^*i3 ~^~ ^13" 



A plot of the eigenvalues shows that they are non-negative 
in the region < di 2 < di3 < 1. In fact, one eigenvalue 



is strictly positive everywhere in this region, and the other 
is strictly positive everywhere in this region except at the 
boundary di3 = 1, where it becomes equal to zero. This is 
consistent with the fact that Va=3)(di2,di3) = V(/ =2 )(<ii 2 ) 
when di3 > 1. For d±s > 1, the Hessian always has a 
vanishing eigenvalue, and a strictly positive one when dn < 1, 
For di2 > 1, the kernel V(i =3 )((ii 2 ,di3) is constant and both 
eigenvalues vanish. To summarize the kernel is always convex, 
and strictly convex for < di3 < 1. 

These results can be generalized for all I, We find the 
general expression of the kernel 

i i 



V l (d 12 ,---,du) = J2Yl 



k—2 m—k 



(l - d lm ) m - k + 3 

(m- jfc + 3)(m- k + 2) 
x ( £ II rf i« 

5C{2,...,m-l}ne5 

The corresponding Hessian (JTy) is a symmetric matrix of 
dimension (I — 1) X (I — 1) with matrix elements that are 
polynomials in du, i = 1, . . . , I. In particular, at dij = for all 






i we have _ff^ = 1 and iJ i? - — - — - , , ; ^~ 

" V j+1 A^77i— j + 1 (m — l)(m— 2) 

— rrr; j > i- Defining v as the (I — 1) -dimensional vector of 
l's and denoting by 1 the (I — 1) -dimensional identity matrix, 
we remark that H at the origin can be expressed as 



H = (l 



1 



l-l 



)1- 



1 



l-l 



The eigenvalues of this matrix are 1 + j^-y and 1 (with 1 + j^y 
having degeneracy I — 2). Since these eigenvalues are strictly 
positive, the Hessian is strictly positive definite at the origin, 
and thus (by continuity) also in a small neighborhood of the 
origin. Thus V is a strictly convex function of du, i = 2, . . . , I 
in a small neighborhood of the origin. 
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